Overview

Dataset statistics

Number of variables39
Number of observations135343
Missing cells438786
Missing cells (%)8.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory40.3 MiB
Average record size in memory312.0 B

Variable types

CAT20
NUM18
BOOL1

Reproduction

Analysis started2020-05-28 20:16:57.565912
Analysis finished2020-05-28 20:23:31.843316
Duration6 minutes and 34.28 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

CloseDate has a high cardinality: 5066 distinct values High cardinality
ElementarySchoolName has a high cardinality: 183 distinct values High cardinality
HighSchoolName has a high cardinality: 78 distinct values High cardinality
MiddleSchoolName has a high cardinality: 88 distinct values High cardinality
StreetName has a high cardinality: 15169 distinct values High cardinality
StreetNumber has a high cardinality: 10356 distinct values High cardinality
ArchitecturalStyle has a high cardinality: 264 distinct values High cardinality
TaxLegalDescription has a high cardinality: 61882 distinct values High cardinality
CurrentPrice is highly correlated with ClosePrice and 1 other fieldsHigh correlation
ClosePrice is highly correlated with CurrentPrice and 1 other fieldsHigh correlation
ListPrice is highly correlated with ClosePrice and 1 other fieldsHigh correlation
City is highly correlated with PostalCodeHigh correlation
PostalCode is highly correlated with CityHigh correlation
SchoolDistrict is highly correlated with HighSchoolName and 2 other fieldsHigh correlation
HighSchoolName is highly correlated with SchoolDistrictHigh correlation
MiddleSchoolName is highly correlated with SchoolDistrictHigh correlation
SeniorHighSchoolName is highly correlated with SchoolDistrictHigh correlation
StreetDirSuffix is highly correlated with StreetDirPrefixHigh correlation
StreetDirPrefix is highly correlated with StreetDirSuffixHigh correlation
HighSchoolName has 1546 (1.1%) missing values Missing
Occupancy has 24961 (18.4%) missing values Missing
SchoolDistrict has 7915 (5.8%) missing values Missing
SeniorHighSchoolName has 85101 (62.9%) missing values Missing
StreetDirPrefix has 131949 (97.5%) missing values Missing
StreetDirSuffix has 134859 (99.6%) missing values Missing
StreetSuffix has 13676 (10.1%) missing values Missing
ArchitecturalStyle has 10952 (8.1%) missing values Missing
TaxLegalDescription has 25819 (19.1%) missing values Missing
OriginalListPrice is highly skewed (γ1 = 241.4671172) Skewed
RATIO_ClosePrice_By_ListPrice is highly skewed (γ1 = 367.8874324) Skewed
RATIO_ClosePrice_By_OriginalListPrice is highly skewed (γ1 = 141.5756303) Skewed
RATIO_CurrentPrice_By_SQFT is highly skewed (γ1 = 89.55370868) Skewed
YearBuilt is highly skewed (γ1 = 97.18739673) Skewed
df_index has unique values Unique
DOM has 1820 (1.3%) zeros Zeros
ParkingSpacesGarage has 4190 (3.1%) zeros Zeros

Variables

df_index
Real number (ℝ≥0)

UNIQUE

Distinct count135343
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean107845.69582468248
Minimum0
Maximum213178
Zeros1
Zeros (%)< 0.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile11367.1
Q154918.5
median108414
Q3161624.5
95-th percentile203138.9
Maximum213178
Range213178
Interquartile range (IQR)106706

Descriptive statistics

Standard deviation61310.8563
Coefficient of variation (CV)0.5685053616
Kurtosis-1.196749337
Mean107845.6958
Median Absolute Deviation (MAD)53346
Skewness-0.02478099529
Sum1.459616001e+10
Variance3759021100
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
81881< 0.1%
 
1504991< 0.1%
 
152771< 0.1%
 
91341< 0.1%
 
111831< 0.1%
 
541921< 0.1%
 
562411< 0.1%
 
521471< 0.1%
 
398651< 0.1%
 
337221< 0.1%
 
Other values (135333)135333> 99.9%
 
ValueCountFrequency (%) 
01< 0.1%
 
11< 0.1%
 
21< 0.1%
 
31< 0.1%
 
71< 0.1%
 
ValueCountFrequency (%) 
2131781< 0.1%
 
2131771< 0.1%
 
2131761< 0.1%
 
2131731< 0.1%
 
2131721< 0.1%
 

PostalCode
Categorical

HIGH CORRELATION

Distinct count23
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.0 MiB
75070
27519
75035
16757
75025
13261
75023
12376
75071
11840
Other values (18)
53590
ValueCountFrequency (%) 
750702751920.3%
 
750351675712.4%
 
75025132619.8%
 
75023123769.1%
 
75071118408.7%
 
75093113408.4%
 
7507484566.2%
 
7507582686.1%
 
7506966284.9%
 
7502465514.8%
 
Other values (13)123479.1%
 

Length

Max length5
Median length5
Mean length5
Min length5

BathsTotal
Real number (ℝ≥0)

Distinct count47
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.5545628514219425
Minimum0.0
Maximum9.3
Zeros24
Zeros (%)< 0.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile2
Q12
median2.1
Q33.1
95-th percentile4.1
Maximum9.3
Range9.3
Interquartile range (IQR)1.1

Descriptive statistics

Standard deviation0.8042001564
Coefficient of variation (CV)0.3148093052
Kurtosis1.975692569
Mean2.554562851
Median Absolute Deviation (MAD)0.1
Skewness1.206939653
Sum345742.2
Variance0.6467378915
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
24936536.5%
 
2.12923621.6%
 
3.11834913.6%
 
31791513.2%
 
480566.0%
 
4.152643.9%
 
117121.3%
 
1.114641.1%
 
5.19870.7%
 
3.26830.5%
 
Other values (37)23121.7%
 
ValueCountFrequency (%) 
024< 0.1%
 
0.12< 0.1%
 
0.24< 0.1%
 
117121.3%
 
1.114641.1%
 
ValueCountFrequency (%) 
9.32< 0.1%
 
9.21< 0.1%
 
8.41< 0.1%
 
8.31< 0.1%
 
8.21< 0.1%
 

BedsTotal
Real number (ℝ≥0)

Distinct count11
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.681261683278781
Minimum0.0
Maximum42.0
Zeros26
Zeros (%)< 0.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile3
Q13
median4
Q34
95-th percentile5
Maximum42
Range42
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.7924186912
Coefficient of variation (CV)0.2152573654
Kurtosis80.91174898
Mean3.681261683
Median Absolute Deviation (MAD)1
Skewness1.741801345
Sum498233
Variance0.6279273822
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
46216045.9%
 
34988136.9%
 
51659712.3%
 
253644.0%
 
68890.7%
 
13610.3%
 
749< 0.1%
 
026< 0.1%
 
811< 0.1%
 
93< 0.1%
 
ValueCountFrequency (%) 
026< 0.1%
 
13610.3%
 
253644.0%
 
34988136.9%
 
46216045.9%
 
ValueCountFrequency (%) 
422< 0.1%
 
93< 0.1%
 
811< 0.1%
 
749< 0.1%
 
68890.7%
 

City
Categorical

HIGH CORRELATION

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.0 MiB
Plano
60709
McKinney
44975
Frisco
23766
Prosper
 
3365
Fairview
 
2528
ValueCountFrequency (%) 
Plano6070944.9%
 
McKinney4497533.2%
 
Frisco2376617.6%
 
Prosper33652.5%
 
Fairview25281.9%
 

Length

Max length8
Median length6
Mean length6.278270764
Min length5

CloseDate
Categorical

HIGH CARDINALITY

Distinct count5066
Unique (%)3.7%
Missing0
Missing (%)0.0%
Memory size1.0 MiB
4/29/2005
 
176
4/28/2005
 
160
8/28/2009
 
158
2/27/2015
 
150
4/30/2009
 
148
Other values (5061)
134551
ValueCountFrequency (%) 
4/29/20051760.1%
 
4/28/20051600.1%
 
8/28/20091580.1%
 
2/27/20151500.1%
 
4/30/20091480.1%
 
6/30/20051440.1%
 
3/31/20051380.1%
 
5/29/20091280.1%
 
4/15/20051280.1%
 
6/26/20091280.1%
 
Other values (5056)13388598.9%
 

Length

Max length10
Median length9
Mean length8.991776449
Min length8

ClosePrice
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count9996
Unique (%)7.4%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean282111.8280486471
Minimum70.0
Maximum6547500.0
Zeros0
Zeros (%)0.0%
Memory size1.0 MiB

Quantile statistics

Minimum70
5-th percentile118726
Q1171000
median240000
Q3331000
95-th percentile579692.75
Maximum6547500
Range6547430
Interquartile range (IQR)160000

Descriptive statistics

Standard deviation192542.8604
Coefficient of variation (CV)0.6825054507
Kurtosis57.30568401
Mean282111.828
Median Absolute Deviation (MAD)75100
Skewness5.026583132
Sum3.818157903e+10
Variance3.707275308e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
25000010960.8%
 
2250009750.7%
 
2750009310.7%
 
3000008830.7%
 
1650008680.6%
 
2650008570.6%
 
2100008530.6%
 
2600008510.6%
 
1750008450.6%
 
2150008440.6%
 
Other values (9986)12633993.3%
 
ValueCountFrequency (%) 
701< 0.1%
 
811< 0.1%
 
1101< 0.1%
 
1651< 0.1%
 
2201< 0.1%
 
ValueCountFrequency (%) 
65475001< 0.1%
 
57200001< 0.1%
 
51500001< 0.1%
 
49000001< 0.1%
 
42750001< 0.1%
 

CurrentPrice
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count9996
Unique (%)7.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean282118.57304596464
Minimum70.0
Maximum6547500.0
Zeros0
Zeros (%)0.0%
Memory size1.0 MiB

Quantile statistics

Minimum70
5-th percentile118726
Q1171000
median240000
Q3331000
95-th percentile579880
Maximum6547500
Range6547430
Interquartile range (IQR)160000

Descriptive statistics

Standard deviation192558.1382
Coefficient of variation (CV)0.6825432871
Kurtosis57.28913019
Mean282118.573
Median Absolute Deviation (MAD)75100
Skewness5.026031861
Sum3.818277403e+10
Variance3.707863659e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
25000010960.8%
 
2250009750.7%
 
2750009310.7%
 
3000008830.7%
 
1650008680.6%
 
2650008570.6%
 
2100008530.6%
 
2600008510.6%
 
1750008450.6%
 
2150008440.6%
 
Other values (9986)12634093.3%
 
ValueCountFrequency (%) 
701< 0.1%
 
811< 0.1%
 
1101< 0.1%
 
1651< 0.1%
 
2201< 0.1%
 
ValueCountFrequency (%) 
65475001< 0.1%
 
57200001< 0.1%
 
51500001< 0.1%
 
49000001< 0.1%
 
42750001< 0.1%
 

DOM
Real number (ℝ≥0)

ZEROS

Distinct count572
Unique (%)0.4%
Missing23
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean48.98737806680461
Minimum0.0
Maximum1649.0
Zeros1820
Zeros (%)1.3%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile2
Q19
median28
Q367
95-th percentile163
Maximum1649
Range1649
Interquartile range (IQR)58

Descriptive statistics

Standard deviation60.37183771
Coefficient of variation (CV)1.232395774
Kurtosis25.05132464
Mean48.98737807
Median Absolute Deviation (MAD)22
Skewness3.257590844
Sum6628972
Variance3644.758788
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
353033.9%
 
450783.8%
 
543533.2%
 
239032.9%
 
635102.6%
 
732452.4%
 
1027802.1%
 
827192.0%
 
1124511.8%
 
923891.8%
 
Other values (562)9958973.6%
 
ValueCountFrequency (%) 
018201.3%
 
118581.4%
 
239032.9%
 
353033.9%
 
450783.8%
 
ValueCountFrequency (%) 
16492< 0.1%
 
13241< 0.1%
 
12431< 0.1%
 
10911< 0.1%
 
10661< 0.1%
 

ElementarySchoolName
Categorical

HIGH CARDINALITY

Distinct count183
Unique (%)0.1%
Missing218
Missing (%)0.2%
Memory size1.0 MiB
Christie
 
5264
Thomas
 
2852
Glenoaks
 
2731
Johnson
 
2721
Curtsinger
 
2551
Other values (178)
119006
ValueCountFrequency (%) 
Christie52643.9%
 
Thomas28522.1%
 
Glenoaks27312.0%
 
Johnson27212.0%
 
Curtsinger25511.9%
 
Gunstream25131.9%
 
Bennett24831.8%
 
Shawnee23951.8%
 
Wolford23101.7%
 
Spears23031.7%
 
Other values (173)10700279.1%
 

Length

Max length27
Median length7
Mean length7.606185765
Min length3

HighSchoolName
Categorical

HIGH CARDINALITY
HIGH CORRELATION
MISSING

Distinct count78
Unique (%)0.1%
Missing1546
Missing (%)1.1%
Memory size1.0 MiB
Mckinney
 
14001
Jasper
 
13073
Clark
 
12368
Centennial
 
11922
Vines
 
11128
Other values (73)
71305
ValueCountFrequency (%) 
Mckinney1400110.3%
 
Jasper130739.7%
 
Clark123689.1%
 
Centennial119228.8%
 
Vines111288.2%
 
Mckinney Boyd107868.0%
 
Shepton85446.3%
 
Frisco84266.2%
 
Mckinneyno83156.1%
 
Williams73005.4%
 
Other values (68)2793420.6%
 

Length

Max length20
Median length7
Mean length7.675565046
Min length3

AssociationType
Categorical

Distinct count3
Unique (%)< 0.1%
Missing2
Missing (%)< 0.1%
Memory size1.0 MiB
Mandatory
86564
None
40966
Voluntary
 
7811
ValueCountFrequency (%) 
Mandatory8656464.0%
 
None4096630.3%
 
Voluntary78115.8%
 
(Missing)2< 0.1%
 

Length

Max length9
Median length9
Mean length7.48649727
Min length3

ListPrice
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count6107
Unique (%)4.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean288127.66547955933
Minimum1.0
Maximum6750000.0
Zeros0
Zeros (%)0.0%
Memory size1.0 MiB

Quantile statistics

Minimum1
5-th percentile120000
Q1174900
median244000
Q3339000
95-th percentile597860
Maximum6750000
Range6749999
Interquartile range (IQR)164100

Descriptive statistics

Standard deviation203843.7197
Coefficient of variation (CV)0.7074770808
Kurtosis69.75373719
Mean288127.6655
Median Absolute Deviation (MAD)76500
Skewness5.563886769
Sum3.899606263e+10
Variance4.155226205e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
16990016571.2%
 
17990014921.1%
 
15990014911.1%
 
19990014591.1%
 
24990014341.1%
 
18990013721.0%
 
14990013191.0%
 
29990012580.9%
 
13990011840.9%
 
22500011780.9%
 
Other values (6097)12149989.8%
 
ValueCountFrequency (%) 
11< 0.1%
 
19951< 0.1%
 
99001< 0.1%
 
132002< 0.1%
 
135001< 0.1%
 
ValueCountFrequency (%) 
67500001< 0.1%
 
63990002< 0.1%
 
54000001< 0.1%
 
51500001< 0.1%
 
49000002< 0.1%
 

LotSize
Categorical

Distinct count10
Unique (%)< 0.1%
Missing14
Missing (%)< 0.1%
Memory size1.0 MiB
Less Than .5 Acre (not Zero)
119650
.5 Acre to .99 Acre
 
6369
Condo/Townhome Lot
 
3391
Zero Lot
 
2758
1 Acre to 2.99 Acres
 
2545
Other values (5)
 
616
ValueCountFrequency (%) 
Less Than .5 Acre (not Zero)11965088.4%
 
.5 Acre to .99 Acre63694.7%
 
Condo/Townhome Lot33912.5%
 
Zero Lot27582.0%
 
1 Acre to 2.99 Acres25451.9%
 
3 Acres to 4.99 Acres3150.2%
 
5 Acres to 9.99 Acres1620.1%
 
10 Acres to 49.99 Acres1280.1%
 
Over 100 Acres9< 0.1%
 
50 Acres to 100 Acres2< 0.1%
 
(Missing)14< 0.1%
 

Length

Max length28
Median length28
Mean length26.73491795
Min length3

MiddleSchoolName
Categorical

HIGH CARDINALITY
HIGH CORRELATION

Distinct count88
Unique (%)0.1%
Missing699
Missing (%)0.5%
Memory size1.0 MiB
Dowell
 
8740
Evans
 
7431
Carpenter
 
7279
Faubion
 
7171
Renner
 
6667
Other values (83)
97356
ValueCountFrequency (%) 
Dowell87406.5%
 
Evans74315.5%
 
Carpenter72795.4%
 
Faubion71715.3%
 
Renner66674.9%
 
Wester66544.9%
 
Clark66244.9%
 
Haggard63174.7%
 
Roach58634.3%
 
Robinson53053.9%
 
Other values (78)6659349.2%
 

Length

Max length21
Median length6
Mean length7.014821601
Min length3

MLSNumber
Real number (ℝ≥0)

Distinct count124901
Unique (%)92.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11896387.741227843
Minimum9308485.0
Maximum14320678.0
Zeros0
Zeros (%)0.0%
Memory size1.0 MiB

Quantile statistics

Minimum9308485
5-th percentile9714982.4
Q110758605
median11710551
Q313275723
95-th percentile14048025.1
Maximum14320678
Range5012193
Interquartile range (IQR)2517118

Descriptive statistics

Standard deviation1378452.483
Coefficient of variation (CV)0.115871516
Kurtosis-1.24648359
Mean11896387.74
Median Absolute Deviation (MAD)1309602
Skewness0.1485541609
Sum1.610092806e+12
Variance1.900131248e+12
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
102651203< 0.1%
 
102834633< 0.1%
 
102934843< 0.1%
 
102659043< 0.1%
 
102760053< 0.1%
 
102950893< 0.1%
 
102995663< 0.1%
 
102146093< 0.1%
 
102793493< 0.1%
 
102145783< 0.1%
 
Other values (124891)135313> 99.9%
 
ValueCountFrequency (%) 
93084851< 0.1%
 
93254341< 0.1%
 
93268791< 0.1%
 
93776601< 0.1%
 
93885821< 0.1%
 
ValueCountFrequency (%) 
143206781< 0.1%
 
143202201< 0.1%
 
143137271< 0.1%
 
143111131< 0.1%
 
143110281< 0.1%
 

NumberOfDiningAreas
Real number (ℝ≥0)

Distinct count8
Unique (%)< 0.1%
Missing2
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean1.7554030190407932
Minimum0.0
Maximum9.0
Zeros723
Zeros (%)0.5%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median2
Q32
95-th percentile2
Maximum9
Range9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4499385172
Coefficient of variation (CV)0.2563163629
Kurtosis1.266541543
Mean1.755403019
Median Absolute Deviation (MAD)0
Skewness-1.181933479
Sum237578
Variance0.2024446693
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
210219975.5%
 
13205923.7%
 
07230.5%
 
33340.2%
 
419< 0.1%
 
54< 0.1%
 
72< 0.1%
 
91< 0.1%
 
(Missing)2< 0.1%
 
ValueCountFrequency (%) 
07230.5%
 
13205923.7%
 
210219975.5%
 
33340.2%
 
419< 0.1%
 
ValueCountFrequency (%) 
91< 0.1%
 
72< 0.1%
 
54< 0.1%
 
419< 0.1%
 
33340.2%
 

NumberOfLivingAreas
Real number (ℝ≥0)

Distinct count10
Unique (%)< 0.1%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean2.067325737760636
Minimum0.0
Maximum9.0
Zeros68
Zeros (%)0.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q33
95-th percentile4
Maximum9
Range9
Interquartile range (IQR)2

Descriptive statistics

Standard deviation0.9108953261
Coefficient of variation (CV)0.4406152884
Kurtosis1.019823253
Mean2.067325738
Median Absolute Deviation (MAD)1
Skewness0.7456857934
Sum279796
Variance0.8297302952
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
25601241.4%
 
13979829.4%
 
33161423.4%
 
465174.8%
 
510610.8%
 
61890.1%
 
0680.1%
 
752< 0.1%
 
818< 0.1%
 
913< 0.1%
 
(Missing)1< 0.1%
 
ValueCountFrequency (%) 
0680.1%
 
13979829.4%
 
25601241.4%
 
33161423.4%
 
465174.8%
 
ValueCountFrequency (%) 
913< 0.1%
 
818< 0.1%
 
752< 0.1%
 
61890.1%
 
510610.8%
 

NumberOfStories
Real number (ℝ≥0)

Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5312576195296395
Minimum0.0
Maximum5.0
Zeros260
Zeros (%)0.2%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q32
95-th percentile2
Maximum5
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.5119453135
Coefficient of variation (CV)0.3343299697
Kurtosis-1.55340794
Mean1.53125762
Median Absolute Deviation (MAD)0
Skewness-0.05709355423
Sum207245
Variance0.262088004
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
27097452.4%
 
16352446.9%
 
35700.4%
 
02600.2%
 
412< 0.1%
 
53< 0.1%
 
ValueCountFrequency (%) 
02600.2%
 
16352446.9%
 
27097452.4%
 
35700.4%
 
412< 0.1%
 
ValueCountFrequency (%) 
53< 0.1%
 
412< 0.1%
 
35700.4%
 
27097452.4%
 
16352446.9%
 

Occupancy
Categorical

MISSING

Distinct count3
Unique (%)< 0.1%
Missing24961
Missing (%)18.4%
Memory size1.0 MiB
Owner
74867
Vacant
32514
Tenant
 
3001
ValueCountFrequency (%) 
Owner7486755.3%
 
Vacant3251424.0%
 
Tenant30012.2%
 
(Missing)2496118.4%
 

Length

Max length6
Median length5
Mean length4.893551938
Min length3

OriginalListPrice
Real number (ℝ≥0)

SKEWED

Distinct count6110
Unique (%)4.5%
Missing6
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean302718.13430178
Minimum0.0
Maximum430000000.0
Zeros18
Zeros (%)< 0.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile123972
Q1175000
median248000
Q3345000
95-th percentile612180
Maximum430000000
Range430000000
Interquartile range (IQR)170000

Descriptive statistics

Standard deviation1491181.774
Coefficient of variation (CV)4.925974381
Kurtosis63369.7939
Mean302718.1343
Median Absolute Deviation (MAD)78100
Skewness241.4671172
Sum4.096896414e+10
Variance2.223623084e+12
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
16990015321.1%
 
17990014531.1%
 
24990013611.0%
 
15990013441.0%
 
18990013261.0%
 
19990013241.0%
 
29990011950.9%
 
14990011920.9%
 
22500011750.9%
 
27500011110.8%
 
Other values (6100)12232490.4%
 
ValueCountFrequency (%) 
018< 0.1%
 
110< 0.1%
 
3201< 0.1%
 
10003< 0.1%
 
19901< 0.1%
 
ValueCountFrequency (%) 
4300000001< 0.1%
 
2999009001< 0.1%
 
1265000001< 0.1%
 
349900001< 0.1%
 
279900002< 0.1%
 

ParkingSpacesGarage
Real number (ℝ≥0)

ZEROS

Distinct count10
Unique (%)< 0.1%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean2.121950318452513
Minimum0.0
Maximum9.0
Zeros4190
Zeros (%)3.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile2
Q12
median2
Q32
95-th percentile3
Maximum9
Range9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.6099141385
Coefficient of variation (CV)0.287430923
Kurtosis8.547737792
Mean2.121950318
Median Absolute Deviation (MAD)0
Skewness-0.003701813448
Sum287189
Variance0.3719952564
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
210331176.3%
 
32398517.7%
 
041903.1%
 
124521.8%
 
411030.8%
 
51600.1%
 
6890.1%
 
720< 0.1%
 
918< 0.1%
 
814< 0.1%
 
(Missing)1< 0.1%
 
ValueCountFrequency (%) 
041903.1%
 
124521.8%
 
210331176.3%
 
32398517.7%
 
411030.8%
 
ValueCountFrequency (%) 
918< 0.1%
 
814< 0.1%
 
720< 0.1%
 
6890.1%
 
51600.1%
 

PoolYN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing1
Missing (%)< 0.1%
Memory size1.0 MiB
False
102369
True
32973
(Missing)
 
1
ValueCountFrequency (%) 
False10236975.6%
 
True3297324.4%
 
(Missing)1< 0.1%
 

PropertySubType
Categorical

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.0 MiB
RES-Single Family
128432
RES-Townhouse
 
4253
RES-Condo
 
1686
RES-Half Duplex
 
911
RES-Farm/Ranch
 
61
ValueCountFrequency (%) 
RES-Single Family12843294.9%
 
RES-Townhouse42533.1%
 
RES-Condo16861.2%
 
RES-Half Duplex9110.7%
 
RES-Farm/Ranch61< 0.1%
 

Length

Max length17
Median length17
Mean length16.75983243
Min length9

RATIO_ClosePrice_By_ListPrice
Real number (ℝ≥0)

SKEWED

Distinct count15559
Unique (%)11.5%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean1.7600840672518505
Minimum0.0005200000000000001
Maximum105000.0
Zeros0
Zeros (%)0.0%
Memory size1.0 MiB

Quantile statistics

Minimum0.00052
5-th percentile0.92563
Q10.96674
median0.98508
Q31
95-th percentile1.0356
Maximum105000
Range104999.9995
Interquartile range (IQR)0.03326

Descriptive statistics

Standard deviation285.4101285
Coefficient of variation (CV)162.1571002
Kurtosis135341.4414
Mean1.760084067
Median Absolute Deviation (MAD)0.01498
Skewness367.8874324
Sum238213.2978
Variance81458.94144
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
11904514.1%
 
0.980393110.2%
 
0.963080.2%
 
0.971432910.2%
 
0.975612750.2%
 
0.952382750.2%
 
0.977782730.2%
 
0.966672640.2%
 
0.967742620.2%
 
0.982470.2%
 
Other values (15549)11379184.1%
 
ValueCountFrequency (%) 
0.000521< 0.1%
 
0.000541< 0.1%
 
0.000951< 0.1%
 
0.000961< 0.1%
 
0.000971< 0.1%
 
ValueCountFrequency (%) 
1050001< 0.1%
 
150.375941< 0.1%
 
101< 0.1%
 
9.478671< 0.1%
 
7.547171< 0.1%
 

RATIO_ClosePrice_By_OriginalListPrice
Real number (ℝ≥0)

SKEWED

Distinct count20706
Unique (%)15.3%
Missing25
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean15.783095333067294
Minimum0.0005099999999999999
Maximum378000.0
Zeros0
Zeros (%)0.0%
Memory size1.0 MiB

Quantile statistics

Minimum0.00051
5-th percentile0.86625
Q10.94275
median0.97397
Q31
95-th percentile1.0351015
Maximum378000
Range377999.9995
Interquartile range (IQR)0.05725

Descriptive statistics

Standard deviation1840.105945
Coefficient of variation (CV)116.58714
Kurtosis22401.75817
Mean15.78309533
Median Absolute Deviation (MAD)0.02603
Skewness141.5756303
Sum2135736.894
Variance3385989.888
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1134539.9%
 
0.952382980.2%
 
0.962880.2%
 
0.971432470.2%
 
0.966672350.2%
 
0.933332330.2%
 
0.952290.2%
 
0.967742250.2%
 
0.909092150.2%
 
0.977782110.2%
 
Other values (20696)11968488.4%
 
ValueCountFrequency (%) 
0.000511< 0.1%
 
0.000521< 0.1%
 
0.000921< 0.1%
 
0.000931< 0.1%
 
0.000941< 0.1%
 
ValueCountFrequency (%) 
3780001< 0.1%
 
2760001< 0.1%
 
2250001< 0.1%
 
2225001< 0.1%
 
1875001< 0.1%
 

RATIO_CurrentPrice_By_SQFT
Real number (ℝ≥0)

SKEWED

Distinct count15772
Unique (%)11.7%
Missing36
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean104.60618068540431
Minimum0.03
Maximum10000.0
Zeros0
Zeros (%)0.0%
Memory size1.0 MiB

Quantile statistics

Minimum0.03
5-th percentile64.26
Q180.56
median96.7
Q3124.005
95-th percentile164.027
Maximum10000
Range9999.97
Interquartile range (IQR)43.445

Descriptive statistics

Standard deviation43.23548462
Coefficient of variation (CV)0.4133167308
Kurtosis20286.53607
Mean104.6061807
Median Absolute Deviation (MAD)19.69
Skewness89.55370868
Sum14153948.49
Variance1869.30713
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
83.33860.1%
 
100850.1%
 
12563< 0.1%
 
90.9160< 0.1%
 
111.1148< 0.1%
 
83.2947< 0.1%
 
85.7145< 0.1%
 
96.1544< 0.1%
 
76.9244< 0.1%
 
92.5943< 0.1%
 
Other values (15762)13474299.6%
 
ValueCountFrequency (%) 
0.031< 0.1%
 
0.041< 0.1%
 
0.071< 0.1%
 
0.082< 0.1%
 
0.11< 0.1%
 
ValueCountFrequency (%) 
100001< 0.1%
 
1217.731< 0.1%
 
10001< 0.1%
 
865.381< 0.1%
 
779.91< 0.1%
 

SchoolDistrict
Categorical

HIGH CORRELATION
MISSING

Distinct count13
Unique (%)< 0.1%
Missing7915
Missing (%)5.8%
Memory size1.0 MiB
Plano ISD
50384
Frisco ISD
35439
McKinney ISD
31795
Prosper ISD
 
6688
Lovejoy ISD
 
1834
Other values (8)
 
1288
ValueCountFrequency (%) 
Plano ISD5038437.2%
 
Frisco ISD3543926.2%
 
McKinney ISD3179523.5%
 
Prosper ISD66884.9%
 
Lovejoy ISD18341.4%
 
Allen ISD7590.6%
 
Lewisville ISD3020.2%
 
Melissa ISD1400.1%
 
Princeton ISD47< 0.1%
 
Celina ISD33< 0.1%
 
Other values (3)7< 0.1%
 
(Missing)79155.8%
 

Length

Max length14
Median length10
Mean length9.756707033
Min length3

SellerType
Categorical

Distinct count2
Unique (%)< 0.1%
Missing978
Missing (%)0.7%
Memory size1.0 MiB
Individual(s)
124987
Lender/REO
 
9378
ValueCountFrequency (%) 
Individual(s)12498792.3%
 
Lender/REO93786.9%
 
(Missing)9780.7%
 

Length

Max length13
Median length13
Mean length12.7198673
Min length3

SeniorHighSchoolName
Categorical

HIGH CORRELATION
MISSING

Distinct count20
Unique (%)< 0.1%
Missing85101
Missing (%)62.9%
Memory size1.0 MiB
Plano Senior
19599
Planoeast
9819
Planowest
9284
Plano West
5410
Plano East
4238
Other values (15)
 
1892
ValueCountFrequency (%) 
Plano Senior1959914.5%
 
Planoeast98197.3%
 
Planowest92846.9%
 
Plano West54104.0%
 
Plano East42383.1%
 
Plano Sr17101.3%
 
Centennial720.1%
 
Frisco65< 0.1%
 
Plano12< 0.1%
 
Jasper7< 0.1%
 
Other values (10)26< 0.1%
 
(Missing)8510162.9%
 

Length

Max length12
Median length3
Mean length5.718552123
Min length3

SqFtTotal
Real number (ℝ≥0)

Distinct count5663
Unique (%)4.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2623.4577924236937
Minimum0.0
Maximum17306.0
Zeros36
Zeros (%)< 0.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile1388
Q11892
median2413
Q33186
95-th percentile4406
Maximum17306
Range17306
Interquartile range (IQR)1294

Descriptive statistics

Standard deviation1019.422058
Coefficient of variation (CV)0.3885795535
Kurtosis5.887842745
Mean2623.457792
Median Absolute Deviation (MAD)611
Skewness1.491585428
Sum355066648
Variance1039221.332
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
19302480.2%
 
20982170.2%
 
17852130.2%
 
16592070.2%
 
18682040.2%
 
15352020.1%
 
22971970.1%
 
18481960.1%
 
18601950.1%
 
20761920.1%
 
Other values (5653)13327298.5%
 
ValueCountFrequency (%) 
036< 0.1%
 
491< 0.1%
 
2191< 0.1%
 
5201< 0.1%
 
5251< 0.1%
 
ValueCountFrequency (%) 
173061< 0.1%
 
150421< 0.1%
 
150002< 0.1%
 
149621< 0.1%
 
149191< 0.1%
 

StreetDirPrefix
Categorical

HIGH CORRELATION
MISSING

Distinct count7
Unique (%)0.2%
Missing131949
Missing (%)97.5%
Memory size1.0 MiB
N
1037
W
970
S
829
E
544
NW
 
6
Other values (2)
 
8
ValueCountFrequency (%) 
N10370.8%
 
W9700.7%
 
S8290.6%
 
E5440.4%
 
NW6< 0.1%
 
NE5< 0.1%
 
SW3< 0.1%
 
(Missing)13194997.5%
 

Length

Max length3
Median length3
Mean length2.949949388
Min length1

StreetDirSuffix
Categorical

HIGH CORRELATION
MISSING

Distinct count8
Unique (%)1.7%
Missing134859
Missing (%)99.6%
Memory size1.0 MiB
N
159
S
143
W
103
E
73
NW
 
3
Other values (3)
 
3
ValueCountFrequency (%) 
N1590.1%
 
S1430.1%
 
W1030.1%
 
E730.1%
 
NW3< 0.1%
 
NE1< 0.1%
 
SW1< 0.1%
 
SE1< 0.1%
 
(Missing)13485999.6%
 

Length

Max length3
Median length3
Mean length2.992892133
Min length1

StreetName
Categorical

HIGH CARDINALITY

Distinct count15169
Unique (%)11.2%
Missing0
Missing (%)0.0%
Memory size1.0 MiB
Park
 
288
Virginia Hills
 
273
Preston
 
249
Cross Bend
 
220
Hickory
 
191
Other values (15164)
134122
ValueCountFrequency (%) 
Park2880.2%
 
Virginia Hills2730.2%
 
Preston2490.2%
 
Cross Bend2200.2%
 
Hickory1910.1%
 
Spring Creek1730.1%
 
14th1490.1%
 
Teakwood1460.1%
 
Mason1310.1%
 
Scenic Ranch1290.1%
 
Other values (15159)13339498.6%
 

Length

Max length20
Median length9
Mean length8.910006428
Min length1

StreetNumber
Categorical

HIGH CARDINALITY

Distinct count10356
Unique (%)7.7%
Missing0
Missing (%)0.0%
Memory size1.0 MiB
2601
 
333
575
 
296
3801
 
284
3101
 
282
2204
 
267
Other values (10351)
133881
ValueCountFrequency (%) 
26013330.2%
 
5752960.2%
 
38012840.2%
 
31012820.2%
 
22042670.2%
 
24002480.2%
 
84002460.2%
 
25242300.2%
 
22002280.2%
 
25002120.2%
 
Other values (10346)13271798.1%
 

Length

Max length9
Median length4
Mean length3.99199072
Min length1

StreetSuffix
Categorical

MISSING

Distinct count44
Unique (%)< 0.1%
Missing13676
Missing (%)10.1%
Memory size1.0 MiB
Drive
65002
Lane
20263
Court
 
8598
Trail
 
5685
Street
 
4757
Other values (39)
17362
ValueCountFrequency (%) 
Drive6500248.0%
 
Lane2026315.0%
 
Court85986.4%
 
Trail56854.2%
 
Street47573.5%
 
Road46963.5%
 
Circle33182.5%
 
Way33002.4%
 
Place19421.4%
 
Avenue12610.9%
 
Other values (34)28452.1%
 
(Missing)1367610.1%
 

Length

Max length9
Median length5
Mean length4.670984092
Min length3

ArchitecturalStyle
Categorical

HIGH CARDINALITY
MISSING

Distinct count264
Unique (%)0.2%
Missing10952
Missing (%)8.1%
Memory size1.0 MiB
Traditional
113275
Ranch
 
2336
Contemporary/Modern
 
1803
Ranch, Traditional
 
1242
Mediterranean
 
752
Other values (259)
 
4983
ValueCountFrequency (%) 
Traditional11327583.7%
 
Ranch23361.7%
 
Contemporary/Modern18031.3%
 
Ranch, Traditional12420.9%
 
Mediterranean7520.6%
 
Other5340.4%
 
Contemporary/Modern, Traditional4480.3%
 
French2890.2%
 
Craftsman2870.2%
 
French, Traditional2670.2%
 
Other values (254)31582.3%
 
(Missing)109528.1%
 

Length

Max length98
Median length11
Mean length10.66080255
Min length3

TaxLegalDescription
Categorical

HIGH CARDINALITY
MISSING

Distinct count61882
Unique (%)56.5%
Missing25819
Missing (%)19.1%
Memory size1.0 MiB
SPRING CREEK PARKWAY ESTATES W
 
217
Spring Creek Parkway Estates W
 
137
WELLINGTON AT PRESTON MEADOWS
 
131
LAKES OF PRESTON VINEYARDS VIL
 
113
VILLAGES OF WHITE ROCK CREEK #
 
106
Other values (61877)
108820
ValueCountFrequency (%) 
SPRING CREEK PARKWAY ESTATES W2170.2%
 
Spring Creek Parkway Estates W1370.1%
 
WELLINGTON AT PRESTON MEADOWS1310.1%
 
LAKES OF PRESTON VINEYARDS VIL1130.1%
 
VILLAGES OF WHITE ROCK CREEK #1060.1%
 
Wellington At Preston Meadows910.1%
 
VALOR POINTE - THE RESERVE AT WESTRIDGE PHASE880.1%
 
Winsor Meadows At Westridge #0870.1%
 
BILTMORE SWIM & RACQUET CLUB #770.1%
 
PLANTATION RESORT AUGUSTA FARM750.1%
 
Other values (61872)10840280.1%
 
(Missing)2581919.1%
 

Length

Max length50
Median length30
Mean length27.52241342
Min length1

YearBuilt
Real number (ℝ≥0)

SKEWED

Distinct count142
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1994.8070605794167
Minimum0.0
Maximum9999.0
Zeros9
Zeros (%)< 0.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile1972
Q11988
median1997
Q32003
95-th percentile2010
Maximum9999
Range9999
Interquartile range (IQR)15

Descriptive statistics

Standard deviation75.08899798
Coefficient of variation (CV)0.03764223592
Kurtosis10524.21113
Mean1994.807061
Median Absolute Deviation (MAD)7
Skewness97.18739673
Sum269983172
Variance5638.357618
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
200564004.7%
 
199862884.6%
 
200161904.6%
 
200061444.5%
 
199960674.5%
 
200659784.4%
 
200459464.4%
 
199756494.2%
 
200353293.9%
 
199652663.9%
 
Other values (132)7608656.2%
 
ValueCountFrequency (%) 
09< 0.1%
 
18573< 0.1%
 
18602< 0.1%
 
18762< 0.1%
 
18771< 0.1%
 
ValueCountFrequency (%) 
999911< 0.1%
 
20201< 0.1%
 
201911< 0.1%
 
201855< 0.1%
 
20171900.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

df_indexPostalCodeBathsTotalBedsTotalCityCloseDateClosePriceCurrentPriceDOMElementarySchoolNameHighSchoolNameAssociationTypeListPriceLotSizeMiddleSchoolNameMLSNumberNumberOfDiningAreasNumberOfLivingAreasNumberOfStoriesOccupancyOriginalListPriceParkingSpacesGaragePoolYNPropertySubTypeRATIO_ClosePrice_By_ListPriceRATIO_ClosePrice_By_OriginalListPriceRATIO_CurrentPrice_By_SQFTSchoolDistrictSellerTypeSeniorHighSchoolNameSqFtTotalStreetDirPrefixStreetDirSuffixStreetNameStreetNumberStreetSuffixArchitecturalStyleTaxLegalDescriptionYearBuilt
00750711.02.0McKinney10/3/2014150000.0150000.01649.0John A BakerProsperNone230000.01 Acre to 2.99 AcresRogers11363157.01.01.01.0Owner274900.02.0FalseRES-Farm/Ranch0.652170.54565151.82Prosper ISDIndividual(s)NaN988.0NNaNCuster5841RoadEarly AmericanAbstract A0412 Horn, George, T1930.0
11750713.04.0McKinney10/3/2014400000.0400000.01649.0John A BakerProsperNone400000.03 Acres to 4.99 AcresRogers11363138.02.02.01.0Tenant424900.02.0FalseRES-Single Family1.000000.94140152.09Prosper ISDIndividual(s)NaN2630.0NaNNaNCuster5799RoadTraditionalNaN1965.0
22750345.05.0Frisco7/29/2013555000.0555000.01324.0SpearsFriscoMandatory530200.0Less Than .5 Acre (not Zero)Hunt11281961.02.03.02.0Owner800000.03.0TrueRES-Single Family1.046770.69375119.92Frisco ISDIndividual(s)NaN4628.0NaNNaNLago Vista5507LaneTraditionalStarwood #04 Village #15, Bloc2004.0
33750783.13.0Prosper10/17/20121490000.01490000.01243.0ProsperProsperNone1900000.010 Acres to 49.99 AcresProsper11207426.02.03.02.0NaN2300000.03.0FalseRES-Farm/Ranch0.784210.64783322.93Prosper ISDIndividual(s)NaN4614.0ENaNFrontier2380ParkwayVictorianW.T. Horn Survey, A-3761990.0
47750345.15.0Frisco12/19/20131138000.01138000.01091.0SmithFriscoMandatory1190000.0Less Than .5 Acre (not Zero)Staley11503844.02.03.02.0NaN1350000.03.0TrueRES-Single Family0.956300.84296217.05Frisco ISDIndividual(s)NaN5243.0NaNNaNBriarwood3074LaneTraditionalNaN2006.0
59750935.26.0Plano6/28/2013906500.0906500.01066.0HuffmanSheptonMandatory998500.0Less Than .5 Acre (not Zero)Renner11424721.02.03.02.0Owner1150000.04.0TrueRES-Single Family0.907860.78826148.97Plano ISDIndividual(s)Planowest6085.0NaNNaNCliffview1816DriveContemporary/ModernCliffs Of Gleneagles, Blk A, L1994.0
610750752.13.0Plano3/17/2014116994.0116994.01047.0SaiglingVinesMandatory117000.0Less Than .5 Acre (not Zero)Haggard11566790.01.02.02.0Owner137500.02.0FalseRES-Townhouse0.999950.8508766.40Plano ISDIndividual(s)Plano Senior1762.0NaNNaNDevonshire3225DriveContemporary/Modern, French, TraditionalCobblestone Townhome Community1987.0
716750703.23.0McKinney12/21/2015870000.0870000.0950.0ComstockLibertyMandatory899000.0Condo/Townhome LotScoggins11933761.02.02.02.0Vacant1045000.02.0FalseRES-Condo0.967740.83254203.99Frisco ISDIndividual(s)NaN4265.0NaNNaNSettlement5724WayTraditionalRESIDENCES AT THE GRAND LODGE2012.0
820750782.13.0Prosper9/17/2014535000.0535000.0883.0Judy RuckerProsperNone594900.010 Acres to 49.99 AcresReynolds11756658.02.01.02.0Owner722775.02.0FalseRES-Single Family0.899310.74020248.84Prosper ISDIndividual(s)NaN2150.0WWProsper2076TrailTraditionalABS A0147 COLLIN COUNTY SCHOOL1987.0
923750693.03.0McKinney12/18/2013250000.0250000.0856.0MalvernMckinneyMandatory269000.0Less Than .5 Acre (not Zero)Dr Jack Cockrill11625707.02.02.02.0Vacant250000.02.0FalseRES-Single Family0.929371.0000099.88McKinney ISDIndividual(s)NaN2503.0NaNNaNPreservation700LaneTraditionalChapel Hill #01b, Blk C, Lot 12008.0

Last rows

df_indexPostalCodeBathsTotalBedsTotalCityCloseDateClosePriceCurrentPriceDOMElementarySchoolNameHighSchoolNameAssociationTypeListPriceLotSizeMiddleSchoolNameMLSNumberNumberOfDiningAreasNumberOfLivingAreasNumberOfStoriesOccupancyOriginalListPriceParkingSpacesGaragePoolYNPropertySubTypeRATIO_ClosePrice_By_ListPriceRATIO_ClosePrice_By_OriginalListPriceRATIO_CurrentPrice_By_SQFTSchoolDistrictSellerTypeSeniorHighSchoolNameSqFtTotalStreetDirPrefixStreetDirSuffixStreetNameStreetNumberStreetSuffixArchitecturalStyleTaxLegalDescriptionYearBuilt
135333213166750703.14.0McKinney12/5/2014264900.0264900.0NaNBennettMckinney BoydMandatory264900.0Less Than .5 Acre (not Zero)Dowell13050048.02.03.02.0Vacant264900.02.0FalseRES-Single Family1.000001.0000086.57McKinney ISDLender/REONaN3060.0NaNNaNTrinity2401LaneNaNFOUNTAINVIEW #3 (CMC), BLK E, LOT 392006.0
135334213167750703.14.0McKinney12/5/2014264900.0264900.0NaNBennettMckinney BoydMandatory264900.0Less Than .5 Acre (not Zero)Dowell13050048.02.03.02.0Vacant264900.02.0FalseRES-Single Family1.000001.0000086.57McKinney ISDLender/REONaN3060.0NaNNaNTrinity2401LaneNaNFOUNTAINVIEW #3 (CMC), BLK E, LOT 392006.0
135335213168750712.04.0McKinney12/18/2014199900.0199900.0NaNWilmethMckinneynoMandatory199900.0Less Than .5 Acre (not Zero)Dr Jack Cockrill13056859.01.01.01.0Owner199900.02.0FalseRES-Single Family1.000001.00000107.24McKinney ISDIndividual(s)NaN1864.0NaNNaNJuno Springs7720WayNaNVIRGINIA PARKLANDS (CMC), BLK B, LOT 142005.0
135336213169750712.04.0McKinney12/18/2014199900.0199900.0NaNWilmethMckinneynoMandatory199900.0Less Than .5 Acre (not Zero)Dr Jack Cockrill13056859.01.01.01.0Owner199900.02.0FalseRES-Single Family1.000001.00000107.24McKinney ISDIndividual(s)NaN1864.0NaNNaNJuno Springs7720WayNaNVIRGINIA PARKLANDS (CMC), BLK B, LOT 142005.0
135337213171750752.04.0Plano8/21/2014235000.0235000.0NaNDavisVinesNone235000.0Less Than .5 Acre (not Zero)Haggard13008187.02.01.01.0NaN235000.02.0TrueRES-Single Family1.000001.00000102.71Plano ISDIndividual(s)Plano Senior2288.0NaNNaNCedar Elm2625LaneNaNTIMBERCREEK ESTATES (CPL), BLK F, LOT 291972.0
135338213172750783.05.0Prosper2/16/2015327000.0327000.0NaNCynthia A CockrellProsperMandatory330000.0Less Than .5 Acre (not Zero)Rogers13086217.02.03.02.0NaN330000.02.0FalseRES-Single Family0.990910.99091100.49Prosper ISDIndividual(s)NaN3254.0NaNNaNCrescent Valley1440DriveNaNCEDAR RIDGE ESTATES (CPR), BLK B, LOT 92011.0
135339213173750783.05.0Prosper2/16/2015327000.0327000.0NaNCynthia A CockrellProsperMandatory330000.0Less Than .5 Acre (not Zero)Rogers13086217.02.03.02.0NaN330000.02.0FalseRES-Single Family0.990910.99091100.49Prosper ISDIndividual(s)NaN3254.0NaNNaNCrescent Valley1440DriveNaNCEDAR RIDGE ESTATES (CPR), BLK B, LOT 92011.0
135340213176750932.04.0Plano3/19/2015145000.0145000.0NaNNaNNaNNone155000.0Less Than .5 Acre (not Zero)NaN13107447.01.01.01.0Owner155000.02.0FalseRES-Single Family0.935480.9354879.32Plano ISDIndividual(s)NaN1828.0NaNNaNBirdsong4400LaneNaNPRESTON COVE (CPL), BLK D, LOT 11980.0
135341213177750932.04.0Plano3/19/2015145000.0145000.0NaNNaNNaNNone155000.0Less Than .5 Acre (not Zero)NaN13107447.01.01.01.0Owner155000.02.0FalseRES-Single Family0.935480.9354879.32Plano ISDIndividual(s)NaN1828.0NaNNaNBirdsong4400LaneNaNPRESTON COVE (CPL), BLK D, LOT 11980.0
135342213178750932.03.0Plano10/30/2014242500.0242500.0NaNBarksdaleSheptonMandatory259900.0Less Than .5 Acre (not Zero)Renner13024976.01.01.01.0Vacant259900.02.0FalseRES-Single Family0.933050.93305131.87Plano ISDIndividual(s)Plano West1839.0WNaNPlano5161ParkwayTraditionalOLD SHEPARD PLACE #2 3 & 4 (CPL), BLK C, LOT 91985.0